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Abstract 



We introduce superposition-based quantum networks composed of (i) 
. the classical perceptron model of multilayered, feedforward neural net- 

Q | works and (ii) the algebraic model of evolving reticular quantum struc- 

* • tures as described in quantum gravity. The main feature of this model 

is moving from particular neural topologies to a quantum metastruc- 
ture which embodies many differing topological patterns. Using quan- 
tum parallelism, training is possible on superpositions of different net- 
. work topologies. As a result, not only classical transition functions, 

^sO \ but also topology becomes a subject of training. The main feature of 

our model is that particular neural networks, with different topolo- 
gies, are quantum states. We consider high-dimensional dissipative 
quantum structures as candidates for implementation of the model. 
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^ ■ Quantum learning networks have been suggested to offer new domains for 

quantum algorithm design [21 EJ El] • Machine learning- inspired architectures 
are self-organizing, robust, and ideal for such tasks as pattern recognition 
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and associative processing. Contemporary quantum network models convey 
their advantage using superposed quantum states on a fixed topological back- 
ground. We suggest superposed quantum topologies as a novel approach to 
quantum neural networks, and present a model of learning and evolving su- 
perposed quantum network topologies, or SQNTs. The mathematical basis 
of this model is predicated upon two existing formalisms: 

• Neural networks: the classical learning model of pattern recognition 

urn 

• Algebraic quantum foam model: the formalism which describes super- 
posed and continuously evolving discrete structures [TTJ EH] 

In order to show how our model can be implemented we consider multidi- 
mensional quantum systems admitting highly degenerate states as described 
in [T3j and the Bose-Einstein condensate as outlined in jTTj. 

An overview of the model is arranged as follows. We begin with a classical 
perceptron model, namely, that of a multilayered feedforward, weakly con- 
nected neural network. Then we extend this model, making it quantum by 
admitting the existence of superpositions of differing topological structures 
of neural networks using Rota algebraic formalism (201 as applied to describe 
the evolution of reticular patterns of quantum spacetimes. This means that 

1. We admit that the topological structure of a perceiving entity can 
change 

2. We admit that these changes may be continuous. 

At first sight these two requirements look contradictory: how can discrete 
structures evolve continuously? These requirements are reconciled in quan- 
tum mechanics. Along these lines we present the model of superposition- 
based quantum network topologies. 

1 Interlude on reticular quantum spacetime 
formalism 

Before coming to SQNT models in more detail and for the sake of self- 
consistency, we recall the necessary constructions from quantum mechan- 
ics used to describe reticular quantum spacetime. The main feature of the 
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quantum mechanical description of a physical system is that we pass from 
configuration space to a complex linear space, called the state space of the 
system. We shall consider systems with finite configuration spaces, therefore 
their state spaces will be finite-dimensional Hilbert spaces H = C N . Sup- 
pose we have a system whose configuration space is {1, . . . , n}. According to 
quantum mechanics, it can be in pure (that is, dispersion free) superposed 
state such as 



Quantum observables, that is, measuring apparata are described by self- 
adjoint operators in the state space of the system in question. The values 
of the observables are the eigenvalues of the operators; they are always real. 
Note that mutually commuting (thus having the same eigenvectors) observ- 
ables are associated with the same measuring apparatus - they are intercon- 
verted just by relabelling the pointer's values. To specify an observable K, 
we consider a decomposition of the unit operator by orthogonal projectors 
and associate a number with each projector. In Dirac notation this reads: 



Spatialization. The spatialization procedure was developed in ^T] for the 
purpose of describing spacetime foam. Its primary feature is to associate 
discrete structures, rather than numerical values ©, with subspaces of the 
state space. More specifically, in standard quantum mechanics the state 
space is defined as a Hilbert space - that is, a complex linear space with an 
inner product, H x H — ► C. Each subspace of H can be associated only with 
its dimension as an integer. We associate with each subspace C a disjoint 
graph, whose number of vertices equals its dimension, dim C. 

The spatialization procedure links some of the vertices, thus associating the 
subspace C with richer structure than a cardinal number. The only require- 
ment is for 7i to be endowed with an associative product structure, rather 
than an inner product. In some cases this structure already exists - for in- 
stance, when 7i is a tensor product of two copies of the same state space H, 
that is 7i = H <g> H. The associative product is defined on factorable vectors 
as 



\tp) = cos a 1 1) + e i0 sina|2) 



(1) 




(2) 
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and then extended by linearity (this is nothing but a usual matrix product). 
The spatialization procedure is briefly outlined as follows: with any directed 
acyclic transitive graph G with N vertices its Rota algebra is associated 
A, whose elements are N x N matrices of the following form. They have 
zero entries a^. when the vertices i, k are not connected in the graph G. For 
instance, consider the graphs 




For these examples, the appropriate Rota algebras take the following form 
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(4) 



For a detailed account of Rota algebras and the spatialization formalism 
supplied with examples the reader is referred to Appendices A, B and |11| I2U]. 

Neural metastructures. The main feature of the proposal put forward 
in this paper is to shift from a particular structure of neural network as per- 
ceptron to something different: a metasystem whose states are perceptrons. 
This is accomplished as follows. Superposed quantum neural networks are 
themselves linear spaces rather than neural networks. With each subspace 
of the state space of SQNT - which is its quantum property, see (J2J above - 
a particular neural network configuration is associated using the above men- 
tioned spatialization procedure. In section El we suggest to use the quantum 
features of SQNTs to enhance the performance of the main task in neural 
networks, namely, their training. 

Realistic physical candidates to implement SQNTs have already been pro- 
posed. The first is based upon the Bose-Einstein condensate, on which a 
quantum brain model has been suggested by G. Vitiello [T7]. A second 
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model has been suggested by G. Savvidy [Hj. Both have the crucial prop- 
erty needed for our purposes, namely they have many individual degrees of 
freedom and admit superpositions. We emphasize that no ab initio associa- 
tion of states with graphs is needed, as graphs are automatically produced 
as a consequence of the spatialization procedure. SQNT implementation 
can also be modelled classically: as demonstrated in ^H], a broad class of 
quantum algorithms can be simulated on classical systems. 

2 Neural networks and Rota algebras 

One of the basic tasks of neural networks is to function as perceptrons, that 
is to recognize signals for which we have no structural theory - for instance, 
to recognize visual patterns. In this section we review the basic principles on 
which perceptrons are based, their learning and training in classical setting 
and show how Rota algebras introduced above emerge in their description. 
We shall deal with multilayered feedforward NNs, such as 



that is, their nodes can be arranged in layers so that (i) no nodes in a given 
layer communicate and (ii) the signals propagate only consecutively via lay- 
ers. The state of a given neural network is the set of values, numerical or 
vectorial, assigned to its nodes. A time-step propagation changes the per- 
ceptron's state so that the value of a node i at time t + 1 depends on its own 
value and the values of nodes adjacent to % at time t. 

Besides their multilayered, high-dimensional structure the filtering and rec- 
ognizing power of neural networks is based on the nonlinearity of their tran- 
sition functions. In some cases, in particular while dealing with stability or 
elasticity issues, we may use linear approximations. This is exactly the case 
we explore in this paper: optimal topological configurations are assumed to 
be robust under small permutations jB]. 
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Training. Initially one starts with a set of patterns for which the clas- 
sification is known. Usually by means of heuristic methods, the topologi- 
cal structure of the network is chosen and then trained via input of known 
patterns and subsequent adjustment of network parameters (transition func- 
tions). Output signals are correlated with patterns from different classes to be 
well-separated with respect to certain criterion. The most popular method to 
adjust transition functions is error backpropagation (see, for instance, [T3|). 

Performance. Signal propagation in the linear approximation can be viewed 
as a matrix multiplication which reduces to a number of arithmetic opera- 
tions. The more links there are between neurons, the more computational 
resources are consumed by the process of pattern recognition. In order for a 
neural network to be faster, we should seek for sparser configurations. 

Therefore, the criterion for 'good matching' should also take performance 
into consideration. As an example we consider the approach of fast neu- 
ral networks, which are weakly connected multilayered feedforward neural 
networks jE] . The idea of these networks grows from the FFT - fast Fourier 
transform. When finding optimal configurations of such networks, we restrict 
the allowed links between neurons and force networks to be sparse. Note that 
in our setting this requirement is followed automatically as we are confined 
by the dimensionality of the state space 7i. 

3 Superposition-based training 

The basic proposal of this paper is to replace training of particular neural net- 
work configurations with training of superposed ones. This can be achieved 
in the following steps: 

(i) SQNT is treated as a system described by quantum mechanical formalism, 
in our special case represented by a matrix algebra A. Note that the term 
'quantum' here is a mere indication of the rules of behavior of the object on 
which the SQNT is implemented, whatever be its 'real' nature. 

(ii) Input signals are vectors from a representation space of the algebra A, 
as are output signals. Connection weights are represented by the matrix 
elements, and can further be represented by matrices of linear operators 
consequent of the spatialization procedure. 
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(iii) Learning is a matching algorithm which selects a subset C of A for which 
the criterion of matching between input and output signals is optimal. At 
this step we obtain the optimal configuration on the metalevel. 

(iv) In order to pass to a particular neural network structure, we consider a 
subalgebra (rather than a subset) of A which is the nearest to the subset C. 

Now let us dwell on the above issues. For the first step, an appropriate 
physical system for SQNT to be implemented should be found. The main 
requirement which it has to satisfy is to possess sufficiently many degrees 
of freedom with controlled access to them. The initial candidates for these 
purposes are entangled quantum registers on which quantum computers are 
based. An alternative model can be considered using the Bose-Einstein con- 
densate [T7] . 

So, we start with a quantum system S with sufficiently many degrees of free- 
dom. To implement (ii), we must draw the distinction between the classes 
of states treated as states of the input register and between the states re- 
sponsible for the operations of multiplication. This step does not affect the 
overall performance of the future perceptron, as it has to be done only once 
(this is analogous to building a computer). The work of the SQNT is to 
evolve according to a prescribed evolution, starting from the initial state (= 
a pattern to recognize) to the final one (= its classification identifier). 

An algorithm seeking for best matching (iii) can be realized as follows. We 
feed in a pattern from a given sample set (= given prescribed state) and then 
let it evolve. The evolution is set in such a way that it realizes the multipli- 
cation operation. To be more precise, we consider any available evolution U t 
and then label the states of SQNT in such a way that for the unit time St the 
resulting action of U$t would be multiplication of appropriate elements. This 
is possible due to the arbitrary nature of labeling of eigenstates of operators 
(see section [IJ. 

The condition for learning to be successful is that the input vector from a 
sample set should be unchanged. In physical terms, this means patterns from 
the sample set should be eigenvectors for the energy operator responsible for 
the evolution. Why do we require it? The point is that we would like our 
output signals to be well separated - that is, they should be stable with 
respect to small variations of the input vector. This, in turn, requires the 
states of the overall register (input, output) to be orthogonal. If they are 



Superposed Quantum Networks 



8 



eigenstates of the generator of the evolution, this holds automatically. We 
denote by C the appropriate eigenspace and call it calculation subspace. 

The last item (iv) is a kind of 'classical correspondence principle' in quantum 
mechanics. SQNT operates as it is - namely, it undergoes quantum evolution 
- but if we would like to represent (perhaps with some losses) its work in 
terms of neural networks, we proceed in the following way. We deal with 
the calculation subspace C of our state space, which we obtained as a result 
of the best matching algorithm, and we have the evolution operator in our 
disposal which is interpreted as multiplication. 

The spatialization procedure can be applicable to C only if it is closed 
with respect to the multiplication, while this requirement may not hold for 
eigenspaces of the energy operator. That is why, following the lines of quan- 
tum mechanics, we consider the nearest subalgebra to C and immediately 
interpret it as a neural network. This suggestion is in full accordance with 
quantum measurement theory, where the wavefunction of the system imme- 
diately 'collapses' to an eigenstate of the appropriate operator. 

4 Concluding remarks 

Presently only a handful of quantum algorithms exist, and these are confined 
to a limited set of specialized applications. Quantum learning architectures 
offer the potential to expand this domain to a much broader class of function- 
ality. We have outlined the fundamentals of a quantum fast-training pattern 
recognition model - superpositional quantum network topologies, or SQNT - 
which provides a rich source for the development of a novel class of quan- 
tum algorithms. The distinguishing feature of our model lies in its ability to 
utilize coherent superposition of unique topological configurations of neural 
networks. 

We suggest two ways to implement our model in physical media based on 
high-dimensional dissipative quantum systems |T%1 ITT] . Quantum simulation 
methods offer an immediate candidate for study of the model: non-classical 
dynamics can be efficiently simulated on ensembles of states of classical Tur- 
ing machines ruled by second-order differential equations, see ^H] for more 
detail. It should be mentioned that although macroscopic physical systems 
are comprised of quantum components, quantum phenomena in the macro- 
scopic realm are usually invisible due to averaging over a large number of 
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degrees of freedom. Nevertheless, the existence of macroscopic systems ex- 
hibiting quantum properties cannot be neglected in many subdomains of the 
life sciences, especially in molecular biology pp. For example, A. S. Davydov 
in his book 'Biology and Quantum Mechanics' [5] studied collective soliton 
excitations in large protein molecules and applied them to such biological 
phenomena as membrane transport, nerve impulse conduction, and muscle 
contraction. Extending beyond relevance to the current model, we suggest 
this could be a potential explanation of the surprisingly high adaptation 
capabilities of living organisms. Classical models of learning and response 
to continuously changing environments require immense computational re- 
sources. We interpret this cLS cL SI gnal that perhaps quantum models could 
be more efficient for this purpose. 

Acknowledgments. The authors highly appreciate the attention to their 
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carried out within the ATIP Quantum Information Science and Technology 
Project, the project G. 0335. 02 of the Flemish Fund for Scientific Research 
and the research program 'Quantum Theory in the Early Universe, ' under 
the auspices of the Russian Basic Research Foundation (RFFI). 
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Appendix A: Rota algebras of transitive graphs 

Rota algebras in Dirac notation. Let X be a reflexive transitive graph. 
For brevity a pair of nodes i,j of X connected by an arrow i — ► j is said to 
be tending. Consider the linear space Q whose basis is labelled by 

tending pairs i — > j of nodes of X. 

Q(X) = | 1001 such that i - j I (5) 

In the sequel, when no confusion occurs, we omit the notation of the graph 
X in parentheses and simply write 

n = n(x) 

Define the product on Q by setting it on its basic elements: 

\m\w\ = { If 1 ; li=f e (e) 

Note that |^)(/| in © is always well-defined since the graph X is assumed 
to be transitive, that is why the existence of darts i — > j and j — >• k always 
enables the existence of % — > k. The space Q with the product © is called 
the Rota algebra of the topological space (X, —>■). These algebras were 
first introduced in ^2] in the context of combinatorial theory. 

The matrix representation of Rota algebras. Given the Rota algebra 
of a transitive graph X, its standard matrix representation is obtained by 
choosing the basis of Q consisting of the elements of the form \i)(k\ = e.^, 
with ik ranging over all tending pairs i — ► k of nodes of X. The matrices e^ 
(called matrix units) are defined as follows: 

, . f 1 m — i and n = k (provided i — > k) 
e*K") = ( otherwise (7) 

where ejfc(m, n) stands for the element in the m-th row and the n-th column 
of the matrix e^. We can also extend the ranging to all pairs of elements of 
X by putting e^ = for i y4 k. Then the product © reads: 



e ik e i'k' — 3ki> e ik> 



(8) 
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To specify a Rota algebra in the standard matrix representation we fix 
the template matrix replacing the unit entries in the incidence matrix 1^ of 
the graph X: 



by wildcards * ranging independently over all numbers. For instance, the 
algebra associated with the two-node graph 
• — has the the following template matrix 



So, we see that any transitive graph can be described in terms of a finite- 
dimensional algebra (for further details we refer to [2U]). 

Appendix B: Spatialization procedure 

Here we describe the spatialization procedure which associates a graph with 
an arbitrary finite-dimensional algebra. 

The emergence of nodes. Let us start with a given finite-dimensional 
associative (and non-commutative, in general) algebra Q. According to stan- 
dard conceptions and methods of modern algebraic geometry, as well as the 
general algebraic approach to physics, we introduce the points of Q as its 
irreducible representations (IRs). So, the first step of the specialization pro- 
cedure is creating (or finding) points of fl (which will become nodes of the 
future graph): 



Standard set with nonstandard topology. When the first spatializa- 
tion step in a standard way © is done we may wish to proceed by connecting 
the set of nodes by arrows. This problem is mathematically equivalent to 
equipping the set of points of the algebra by a topology. There are stan- 
dard recipes for this step as well like, say, the Zariski topology on the prime 
spectrum of Q. Unfortunately, on finite-dimensional algebras this topology 





{ points } = { IRs } 



(9) 



Superposed Quantum Networks 



12 



is always discrete, which leaves us no chance to fit the above requirement of 
being non-Hausdorff (ie, not T2). In terms of graphs that means that the 
standard recipes can not help us to create arrows. So, we are compelled to 
find another topology. For these purposes the Rota topology is suggested 
(first it was introduced in |20j). 

Let fl be a finite-dimensional algebra. Denote by X the set of points of 
fl, each of which we shall associate with a prime ideal in Q. Consider two 
points (representations of fl) i,j G X and denote by ker z, ker j their kernels. 
Both of them, being kernels of representations, are two-sided ideals in fl, in 
particular, subsets of fl, hence both of the following expressions make sense: 

ker i fl ker jcfl and ker i ■ ker jcO 

the latter denoting the product of subsets of fl: keri ■ ker j = {a G Q \ 3u G 
keri, v G ker j : uv = a}. Since keri, ker j are ideals, we always have the 
inclusion kerz ■ ker j C kerz fl kerj which may be strict or not. Define the 
relation oc on X as follows: 

i oc j if and only if keri ker j 7^ ker % fl ker j (10) 

Then the Rota topology is the weakest one in which i oc j implies the 
convergence i — > j of the point i to the point j. Explicitly, the necessary and 
sufficient conditions for i to converge to j in the Rota topology reads: 

i — > j if and only if 3k , ■ ■ ■ , h, . . . , k n \ k = i, k n = j; kt-\ oc k t (11) 

This operation is called the transitive closure of the relation oc. Note that, 
in general, the Rota topology can be defined on any set of ideals. 

It was proved by Stanley ^H] that in that particular case when fl is 
the Rota algebra of a reflexive transitive graph, then its spatialization X 
endowed with the Rota topology (fTTj) is homeomorphic to the initial topo- 
logical space. However, in general if we have two reflexive transitive graphs 
and an arrow-preserving mapping between them, their Rota algebras may 
not be homomorphic. Recently, 'good' classes of reflexive transitive graphs 
topological spaces were discovered for which the transition to Rota algebras 
is functorial |H1IS]- In our approach, it supports the existence of the classical 
limit. 
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